Exploring Memory Management Strategies in Catamount
نویسندگان
چکیده
In this paper, we describe how the mapping of virtual to physical memory that is set up by the operating system can have a significant and unexpected impact on the performance of STREAM [6] and a sparse solver “mini-application” that exhibits STREAM-like behavior. The Cray Performance Analysis Toolkit (CrayPAT) [2] is utilized to identify the cause of the performance degradation to be row buffer conflicts in the memory system. This is shown to be due to the internal architecture of modern commodity DRAM chips. To mitigate the performance impact of row buffer conflicts, we explore alternative strategies for constructing a process’s virtual to physical memory mapping. Ultimately, this work points out another level of locality in the memory system, i.e., DRAM row buffers, that few system software and application developers are aware of and that can have a significant impact on performance.
منابع مشابه
Lightweight Kernel Support for Direct Shared Memory Access on a Multi-Core Processor
This paper describes an enhancement to the Catamount lightweight kernel for direct shared memory access between processes running on a multi-core processor as part of a parallel application. Unlike traditional shared memory support for interprocess communication, which involves dynamic memory allocation and mapping, we leverage Catamount’s static contiguous memory mapping scheme to allows the p...
متن کاملPerformance and Memory Evaluation Using TAU
The TAU performance system is an integrated performance instrumentation, measurement, and analysis toolkit offering support for profiling and tracing modes of measurement. This paper introduces memory introspection capabilities of TAU featured on the Cray XT3 Catamount compute node kernel. TAU supports examining the memory headroom, or the amount of heap memory available, at routine entry, and ...
متن کاملSoftware Architecture of the Light Weight Kernel, Catamount
Catamount is designed to be a low overhead operating system for a parallel computing environment. Functionality is limited to the minimum set needed to run a scientific computation. The design choices and implementations will be presented.
متن کاملInvestigating the effectiveness of memory-based computer cognitive games in improving working memory and sustained attention of students with learning disability
Abstract Background and aim: Nowadays, exploring strategies to help students with learning disabilities is one of the areas of interest for many psychologists and researchers. The aim of this study was to evaluate the effectiveness of memory-based computer cognitive games in improving working memory and sustained attention of students with learning disabilities in Arak. Method: The method ...
متن کاملA Prototype Implementation of MPI for SMARTMAP
Recently the Catamount lightweight kernel was extended to support direct access shared memory between processes running on the same compute node. This extension, called SMARTMAP, allows each process read/write access to another process’ memory by extending the virtual address mapping. Simple virtual address bit manipulation can be used to access the same virtual address in a different process’ ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2008